Dataset statistics
| Number of variables | 21 |
|---|---|
| Number of observations | 98913 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 13.2 MiB |
| Average record size in memory | 140.0 B |
Variable types
| Numeric | 12 |
|---|---|
| Categorical | 5 |
| Boolean | 4 |
countryCode has a high cardinality: 199 distinct values | High cardinality |
seniority is highly correlated with seniorityAsYears | High correlation |
seniorityAsYears is highly correlated with seniority | High correlation |
gender is highly correlated with civilityGenderId and 1 other fields | High correlation |
civilityGenderId is highly correlated with gender and 1 other fields | High correlation |
civilityTitle is highly correlated with gender and 1 other fields | High correlation |
socialNbFollowers is highly skewed (γ1 = 88.81691016) | Skewed |
socialNbFollows is highly skewed (γ1 = 220.8766787) | Skewed |
socialProductsLiked is highly skewed (γ1 = 244.1577429) | Skewed |
productsListed is highly skewed (γ1 = 64.89321853) | Skewed |
productsSold is highly skewed (γ1 = 41.59563253) | Skewed |
productsWished is highly skewed (γ1 = 49.25695941) | Skewed |
productsBought is highly skewed (γ1 = 84.79735987) | Skewed |
identifierHash has unique values | Unique |
socialProductsLiked has 82987 (83.9%) zeros | Zeros |
productsListed has 97189 (98.3%) zeros | Zeros |
productsSold has 96877 (97.9%) zeros | Zeros |
productsPassRate has 97979 (99.1%) zeros | Zeros |
productsWished has 89612 (90.6%) zeros | Zeros |
productsBought has 93494 (94.5%) zeros | Zeros |
Reproduction
| Analysis started | 2021-02-15 18:14:23.029439 |
|---|---|
| Analysis finished | 2021-02-15 18:16:22.326457 |
| Duration | 1 minute and 59.3 seconds |
| Software version | pandas-profiling v2.10.1 |
| Download configuration | config.yaml |
| Distinct | 98913 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -6.692038995 × 1015 |
|---|---|
| Minimum | -9.223101126 × 1018 |
| Maximum | 9.223330728 × 1018 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 772.9 KiB |
Quantile statistics
| Minimum | -9.223101126 × 1018 |
|---|---|
| 5-th percentile | -8.30032878 × 1018 |
| Q1 | -4.622894617 × 1018 |
| median | -1.337988846 × 1015 |
| Q3 | 4.616388118 × 1018 |
| 95-th percentile | 8.305984346 × 1018 |
| Maximum | 9.223330728 × 1018 |
| Range | -3.122194426 × 1014 |
| Interquartile range (IQR) | 9.239282735 × 1018 |
Descriptive statistics
| Standard deviation | 5.33080688 × 1018 |
|---|---|
| Coefficient of variation (CV) | -796.5893332 |
| Kurtosis | -1.201867217 |
| Mean | -6.692038995 × 1015 |
| Median Absolute Deviation (MAD) | 4.619299754 × 1018 |
| Skewness | 0.001133563788 |
| Sum | 2.153133565 × 1018 |
| Variance | 2.8417502 × 1037 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 4.86001103 × 1018 | 1 | < 0.1% |
| -8.302234935 × 1018 | 1 | < 0.1% |
| -7.914149281 × 1018 | 1 | < 0.1% |
| 6.582480496 × 1018 | 1 | < 0.1% |
| -8.950562328 × 1018 | 1 | < 0.1% |
| -8.021904012 × 1018 | 1 | < 0.1% |
| -4.962304304 × 1018 | 1 | < 0.1% |
| -4.149872466 × 1018 | 1 | < 0.1% |
| 2.092426645 × 1018 | 1 | < 0.1% |
| 6.002771697 × 1018 | 1 | < 0.1% |
| Other values (98903) | 98903 |
| Value | Count | Frequency (%) |
| -9.223101126 × 1018 | 1 | |
| -9.223057731 × 1018 | 1 | |
| -9.222867488 × 1018 | 1 | |
| -9.222666406 × 1018 | 1 | |
| -9.222346324 × 1018 | 1 |
| Value | Count | Frequency (%) |
| 9.223330728 × 1018 | 1 | |
| 9.223304665 × 1018 | 1 | |
| 9.222858252 × 1018 | 1 | |
| 9.222779374 × 1018 | 1 | |
| 9.222469665 × 1018 | 1 |
language
Categorical
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 772.9 KiB |
| en | |
|---|---|
| fr | |
| it | |
| de | |
| es |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Characters and Unicode
| Total characters | 197826 |
|---|---|
| Distinct characters | 8 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | en |
|---|---|
| 2nd row | en |
| 3rd row | fr |
| 4th row | en |
| 5th row | en |
| Value | Count | Frequency (%) |
| en | 51564 | |
| fr | 26372 | |
| it | 7766 | 7.9% |
| de | 7178 | 7.3% |
| es | 6033 | 6.1% |
| Value | Count | Frequency (%) |
| en | 51564 | |
| fr | 26372 | |
| it | 7766 | 7.9% |
| de | 7178 | 7.3% |
| es | 6033 | 6.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 64775 | |
| n | 51564 | |
| f | 26372 | |
| r | 26372 | |
| i | 7766 | 3.9% |
| t | 7766 | 3.9% |
| d | 7178 | 3.6% |
| s | 6033 | 3.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 197826 |
Most frequent character per category
| Value | Count | Frequency (%) |
| e | 64775 | |
| n | 51564 | |
| f | 26372 | |
| r | 26372 | |
| i | 7766 | 3.9% |
| t | 7766 | 3.9% |
| d | 7178 | 3.6% |
| s | 6033 | 3.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 197826 |
Most frequent character per script
| Value | Count | Frequency (%) |
| e | 64775 | |
| n | 51564 | |
| f | 26372 | |
| r | 26372 | |
| i | 7766 | 3.9% |
| t | 7766 | 3.9% |
| d | 7178 | 3.6% |
| s | 6033 | 3.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 197826 |
Most frequent character per block
| Value | Count | Frequency (%) |
| e | 64775 | |
| n | 51564 | |
| f | 26372 | |
| r | 26372 | |
| i | 7766 | 3.9% |
| t | 7766 | 3.9% |
| d | 7178 | 3.6% |
| s | 6033 | 3.0% |
| Distinct | 90 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.432268761 |
|---|---|
| Minimum | 3 |
| Maximum | 744 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 772.9 KiB |
Quantile statistics
| Minimum | 3 |
|---|---|
| 5-th percentile | 3 |
| Q1 | 3 |
| median | 3 |
| Q3 | 3 |
| 95-th percentile | 5 |
| Maximum | 744 |
| Range | 741 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 3.882383028 |
|---|---|
| Coefficient of variation (CV) | 1.131141906 |
| Kurtosis | 14415.30703 |
| Mean | 3.432268761 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 88.81691016 |
| Sum | 339496 |
| Variance | 15.07289798 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 3 | 84939 | |
| 4 | 8219 | 8.3% |
| 5 | 2720 | 2.7% |
| 6 | 813 | 0.8% |
| 7 | 539 | 0.5% |
| 8 | 336 | 0.3% |
| 9 | 235 | 0.2% |
| 10 | 164 | 0.2% |
| 11 | 121 | 0.1% |
| 12 | 99 | 0.1% |
| Other values (80) | 728 | 0.7% |
| Value | Count | Frequency (%) |
| 3 | 84939 | |
| 4 | 8219 | 8.3% |
| 5 | 2720 | 2.7% |
| 6 | 813 | 0.8% |
| 7 | 539 | 0.5% |
| Value | Count | Frequency (%) |
| 744 | 1 | |
| 353 | 1 | |
| 205 | 1 | |
| 176 | 1 | |
| 172 | 1 |
| Distinct | 85 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 8.42567711 |
|---|---|
| Minimum | 0 |
| Maximum | 13764 |
| Zeros | 39 |
| Zeros (%) | < 0.1% |
| Memory size | 772.9 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 8 |
| Q1 | 8 |
| median | 8 |
| Q3 | 8 |
| 95-th percentile | 8 |
| Maximum | 13764 |
| Range | 13764 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 52.83957192 |
|---|---|
| Coefficient of variation (CV) | 6.271255262 |
| Kurtosis | 52718.3891 |
| Mean | 8.42567711 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 220.8766787 |
| Sum | 833409 |
| Variance | 2792.02036 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 8 | 94893 | |
| 9 | 2386 | 2.4% |
| 10 | 618 | 0.6% |
| 11 | 260 | 0.3% |
| 12 | 148 | 0.1% |
| 13 | 94 | 0.1% |
| 15 | 55 | 0.1% |
| 14 | 53 | 0.1% |
| 7 | 52 | 0.1% |
| 0 | 39 | < 0.1% |
| Other values (75) | 315 | 0.3% |
| Value | Count | Frequency (%) |
| 0 | 39 | |
| 1 | 5 | < 0.1% |
| 2 | 8 | < 0.1% |
| 3 | 6 | < 0.1% |
| 4 | 11 | < 0.1% |
| Value | Count | Frequency (%) |
| 13764 | 1 | |
| 8268 | 1 | |
| 3649 | 1 | |
| 2013 | 1 | |
| 500 | 1 |
| Distinct | 420 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4.420743482 |
|---|---|
| Minimum | 0 |
| Maximum | 51671 |
| Zeros | 82987 |
| Zeros (%) | 83.9% |
| Memory size | 772.9 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 8 |
| Maximum | 51671 |
| Range | 51671 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 181.0305695 |
|---|---|
| Coefficient of variation (CV) | 40.95025423 |
| Kurtosis | 67765.24122 |
| Mean | 4.420743482 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 244.1577429 |
| Sum | 437269 |
| Variance | 32772.06708 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 82987 | |
| 1 | 5261 | 5.3% |
| 2 | 1898 | 1.9% |
| 3 | 1215 | 1.2% |
| 4 | 973 | 1.0% |
| 5 | 644 | 0.7% |
| 6 | 532 | 0.5% |
| 7 | 436 | 0.4% |
| 8 | 359 | 0.4% |
| 9 | 316 | 0.3% |
| Other values (410) | 4292 | 4.3% |
| Value | Count | Frequency (%) |
| 0 | 82987 | |
| 1 | 5261 | 5.3% |
| 2 | 1898 | 1.9% |
| 3 | 1215 | 1.2% |
| 4 | 973 | 1.0% |
| Value | Count | Frequency (%) |
| 51671 | 1 | |
| 16040 | 1 | |
| 7044 | 1 | |
| 5979 | 1 | |
| 5598 | 1 |
| Distinct | 65 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.09330421684 |
|---|---|
| Minimum | 0 |
| Maximum | 244 |
| Zeros | 97189 |
| Zeros (%) | 98.3% |
| Memory size | 772.9 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 244 |
| Range | 244 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 2.050143546 |
|---|---|
| Coefficient of variation (CV) | 21.97267835 |
| Kurtosis | 5760.301256 |
| Mean | 0.09330421684 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 64.89321853 |
| Sum | 9229 |
| Variance | 4.203088557 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 97189 | |
| 1 | 808 | 0.8% |
| 2 | 278 | 0.3% |
| 3 | 150 | 0.2% |
| 4 | 98 | 0.1% |
| 5 | 62 | 0.1% |
| 6 | 45 | < 0.1% |
| 7 | 40 | < 0.1% |
| 8 | 29 | < 0.1% |
| 10 | 22 | < 0.1% |
| Other values (55) | 192 | 0.2% |
| Value | Count | Frequency (%) |
| 0 | 97189 | |
| 1 | 808 | 0.8% |
| 2 | 278 | 0.3% |
| 3 | 150 | 0.2% |
| 4 | 98 | 0.1% |
| Value | Count | Frequency (%) |
| 244 | 1 | |
| 217 | 1 | |
| 202 | 1 | |
| 185 | 1 | |
| 123 | 1 |
| Distinct | 75 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.1215917018 |
|---|---|
| Minimum | 0 |
| Maximum | 174 |
| Zeros | 96877 |
| Zeros (%) | 97.9% |
| Memory size | 772.9 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 174 |
| Range | 174 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 2.126895354 |
|---|---|
| Coefficient of variation (CV) | 17.49210943 |
| Kurtosis | 2355.673441 |
| Mean | 0.1215917018 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 41.59563253 |
| Sum | 12027 |
| Variance | 4.523683846 |
| Monotocity | Decreasing |
| Value | Count | Frequency (%) |
| 0 | 96877 | |
| 1 | 917 | 0.9% |
| 2 | 325 | 0.3% |
| 3 | 154 | 0.2% |
| 4 | 124 | 0.1% |
| 5 | 58 | 0.1% |
| 6 | 58 | 0.1% |
| 7 | 45 | < 0.1% |
| 9 | 42 | < 0.1% |
| 8 | 31 | < 0.1% |
| Other values (65) | 282 | 0.3% |
| Value | Count | Frequency (%) |
| 0 | 96877 | |
| 1 | 917 | 0.9% |
| 2 | 325 | 0.3% |
| 3 | 154 | 0.2% |
| 4 | 124 | 0.1% |
| Value | Count | Frequency (%) |
| 174 | 1 | |
| 170 | 1 | |
| 163 | 1 | |
| 152 | 1 | |
| 125 | 1 |
| Distinct | 72 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.8123027307 |
|---|---|
| Minimum | 0 |
| Maximum | 100 |
| Zeros | 97979 |
| Zeros (%) | 99.1% |
| Memory size | 772.9 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 100 |
| Range | 100 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 8.500205194 |
|---|---|
| Coefficient of variation (CV) | 10.46433167 |
| Kurtosis | 114.0391218 |
| Mean | 0.8123027307 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 10.66729865 |
| Sum | 80347.3 |
| Variance | 72.25348834 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 97979 | |
| 100 | 441 | 0.4% |
| 66 | 63 | 0.1% |
| 50 | 57 | 0.1% |
| 75 | 42 | < 0.1% |
| 83 | 25 | < 0.1% |
| 90 | 25 | < 0.1% |
| 80 | 22 | < 0.1% |
| 85 | 20 | < 0.1% |
| 60 | 16 | < 0.1% |
| Other values (62) | 223 | 0.2% |
| Value | Count | Frequency (%) |
| 0 | 97979 | |
| 25 | 5 | < 0.1% |
| 28 | 2 | < 0.1% |
| 31 | 1 | < 0.1% |
| 33 | 8 | < 0.1% |
| Value | Count | Frequency (%) |
| 100 | 441 | |
| 99 | 1 | < 0.1% |
| 98.7 | 1 | < 0.1% |
| 98 | 8 | < 0.1% |
| 96.4 | 1 | < 0.1% |
| Distinct | 279 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.562595412 |
|---|---|
| Minimum | 0 |
| Maximum | 2635 |
| Zeros | 89612 |
| Zeros (%) | 90.6% |
| Memory size | 772.9 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 2 |
| Maximum | 2635 |
| Range | 2635 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 25.19279323 |
|---|---|
| Coefficient of variation (CV) | 16.12240317 |
| Kurtosis | 3369.163069 |
| Mean | 1.562595412 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 49.25695941 |
| Sum | 154561 |
| Variance | 634.6768308 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 89612 | |
| 1 | 3375 | 3.4% |
| 2 | 1339 | 1.4% |
| 3 | 797 | 0.8% |
| 4 | 526 | 0.5% |
| 5 | 406 | 0.4% |
| 6 | 299 | 0.3% |
| 7 | 252 | 0.3% |
| 8 | 176 | 0.2% |
| 9 | 158 | 0.2% |
| Other values (269) | 1973 | 2.0% |
| Value | Count | Frequency (%) |
| 0 | 89612 | |
| 1 | 3375 | 3.4% |
| 2 | 1339 | 1.4% |
| 3 | 797 | 0.8% |
| 4 | 526 | 0.5% |
| Value | Count | Frequency (%) |
| 2635 | 1 | |
| 1916 | 1 | |
| 1900 | 1 | |
| 1842 | 1 | |
| 1820 | 1 |
| Distinct | 70 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.1719288668 |
|---|---|
| Minimum | 0 |
| Maximum | 405 |
| Zeros | 93494 |
| Zeros (%) | 94.5% |
| Memory size | 772.9 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 1 |
| Maximum | 405 |
| Range | 405 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 2.332265666 |
|---|---|
| Coefficient of variation (CV) | 13.56529424 |
| Kurtosis | 11871.75975 |
| Mean | 0.1719288668 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 84.79735987 |
| Sum | 17006 |
| Variance | 5.439463136 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 93494 | |
| 1 | 3297 | 3.3% |
| 2 | 845 | 0.9% |
| 3 | 364 | 0.4% |
| 4 | 214 | 0.2% |
| 5 | 139 | 0.1% |
| 6 | 108 | 0.1% |
| 7 | 65 | 0.1% |
| 8 | 52 | 0.1% |
| 9 | 40 | < 0.1% |
| Other values (60) | 295 | 0.3% |
| Value | Count | Frequency (%) |
| 0 | 93494 | |
| 1 | 3297 | 3.3% |
| 2 | 845 | 0.9% |
| 3 | 364 | 0.4% |
| 4 | 214 | 0.2% |
| Value | Count | Frequency (%) |
| 405 | 1 | |
| 279 | 1 | |
| 174 | 1 | |
| 115 | 1 | |
| 105 | 1 |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 772.9 KiB |
| F | |
|---|---|
| M |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 98913 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | M |
|---|---|
| 2nd row | F |
| 3rd row | F |
| 4th row | F |
| 5th row | F |
| Value | Count | Frequency (%) |
| F | 76121 | |
| M | 22792 | 23.0% |
| Value | Count | Frequency (%) |
| f | 76121 | |
| m | 22792 | 23.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| F | 76121 | |
| M | 22792 | 23.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 98913 |
Most frequent character per category
| Value | Count | Frequency (%) |
| F | 76121 | |
| M | 22792 | 23.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 98913 |
Most frequent character per script
| Value | Count | Frequency (%) |
| F | 76121 | |
| M | 22792 | 23.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 98913 |
Most frequent character per block
| Value | Count | Frequency (%) |
| F | 76121 | |
| M | 22792 | 23.0% |
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 772.9 KiB |
| 2 | |
|---|---|
| 1 | |
| 3 | 437 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 98913 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 2 |
| 3rd row | 2 |
| 4th row | 2 |
| 5th row | 2 |
| Value | Count | Frequency (%) |
| 2 | 75684 | |
| 1 | 22792 | 23.0% |
| 3 | 437 | 0.4% |
| Value | Count | Frequency (%) |
| 2 | 75684 | |
| 1 | 22792 | 23.0% |
| 3 | 437 | 0.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 75684 | |
| 1 | 22792 | 23.0% |
| 3 | 437 | 0.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 98913 |
Most frequent character per category
| Value | Count | Frequency (%) |
| 2 | 75684 | |
| 1 | 22792 | 23.0% |
| 3 | 437 | 0.4% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 98913 |
Most frequent character per script
| Value | Count | Frequency (%) |
| 2 | 75684 | |
| 1 | 22792 | 23.0% |
| 3 | 437 | 0.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 98913 |
Most frequent character per block
| Value | Count | Frequency (%) |
| 2 | 75684 | |
| 1 | 22792 | 23.0% |
| 3 | 437 | 0.4% |
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 772.9 KiB |
| mrs | |
|---|---|
| mr | |
| miss | 437 |
Length
| Max length | 4 |
|---|---|
| Median length | 3 |
| Mean length | 2.773993307 |
| Min length | 2 |
Characters and Unicode
| Total characters | 274384 |
|---|---|
| Distinct characters | 4 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | mr |
|---|---|
| 2nd row | mrs |
| 3rd row | mrs |
| 4th row | mrs |
| 5th row | mrs |
| Value | Count | Frequency (%) |
| mrs | 75684 | |
| mr | 22792 | 23.0% |
| miss | 437 | 0.4% |
| Value | Count | Frequency (%) |
| mrs | 75684 | |
| mr | 22792 | 23.0% |
| miss | 437 | 0.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| m | 98913 | |
| r | 98476 | |
| s | 76558 | |
| i | 437 | 0.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 274384 |
Most frequent character per category
| Value | Count | Frequency (%) |
| m | 98913 | |
| r | 98476 | |
| s | 76558 | |
| i | 437 | 0.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 274384 |
Most frequent character per script
| Value | Count | Frequency (%) |
| m | 98913 | |
| r | 98476 | |
| s | 76558 | |
| i | 437 | 0.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 274384 |
Most frequent character per block
| Value | Count | Frequency (%) |
| m | 98913 | |
| r | 98476 | |
| s | 76558 | |
| i | 437 | 0.2% |
hasAnyApp
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 96.7 KiB |
| False | |
|---|---|
| True |
| Value | Count | Frequency (%) |
| False | 72739 | |
| True | 26174 | 26.5% |
hasAndroidApp
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 96.7 KiB |
| False | |
|---|---|
| True | 4819 |
| Value | Count | Frequency (%) |
| False | 94094 | |
| True | 4819 | 4.9% |
hasIosApp
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 96.7 KiB |
| False | |
|---|---|
| True |
| Value | Count | Frequency (%) |
| False | 77386 | |
| True | 21527 | 21.8% |
hasProfilePicture
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 96.7 KiB |
| True | |
|---|---|
| False | 1895 |
| Value | Count | Frequency (%) |
| True | 97018 | |
| False | 1895 | 1.9% |
daysSinceLastLogin
Real number (ℝ≥0)
| Distinct | 699 |
|---|---|
| Distinct (%) | 0.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 581.2912357 |
|---|---|
| Minimum | 11 |
| Maximum | 709 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 772.9 KiB |
Quantile statistics
| Minimum | 11 |
|---|---|
| 5-th percentile | 43 |
| Q1 | 572 |
| median | 694 |
| Q3 | 702 |
| 95-th percentile | 708 |
| Maximum | 709 |
| Range | 698 |
| Interquartile range (IQR) | 130 |
Descriptive statistics
| Standard deviation | 208.8558881 |
|---|---|
| Coefficient of variation (CV) | 0.3592964684 |
| Kurtosis | 1.388704906 |
| Mean | 581.2912357 |
| Median Absolute Deviation (MAD) | 11 |
| Skewness | -1.675425192 |
| Sum | 57497260 |
| Variance | 43620.782 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 702 | 3838 | 3.9% |
| 703 | 3792 | 3.8% |
| 695 | 3677 | 3.7% |
| 696 | 3565 | 3.6% |
| 701 | 3516 | 3.6% |
| 700 | 3397 | 3.4% |
| 693 | 3384 | 3.4% |
| 694 | 3368 | 3.4% |
| 705 | 3328 | 3.4% |
| 704 | 3284 | 3.3% |
| Other values (689) | 63764 |
| Value | Count | Frequency (%) |
| 11 | 811 | |
| 12 | 409 | |
| 13 | 344 | |
| 14 | 311 | 0.3% |
| 15 | 235 | 0.2% |
| Value | Count | Frequency (%) |
| 709 | 2910 | |
| 708 | 2857 | |
| 707 | 2797 | |
| 706 | 2643 | |
| 705 | 3328 |
| Distinct | 19 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3063.77187 |
|---|---|
| Minimum | 2852 |
| Maximum | 3205 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 772.9 KiB |
Quantile statistics
| Minimum | 2852 |
|---|---|
| 5-th percentile | 2853 |
| Q1 | 2857 |
| median | 3196 |
| Q3 | 3201 |
| 95-th percentile | 3205 |
| Maximum | 3205 |
| Range | 353 |
| Interquartile range (IQR) | 344 |
Descriptive statistics
| Standard deviation | 168.2986205 |
|---|---|
| Coefficient of variation (CV) | 0.05493183815 |
| Kurtosis | -1.816504427 |
| Mean | 3063.77187 |
| Median Absolute Deviation (MAD) | 8 |
| Skewness | -0.4270896795 |
| Sum | 303046867 |
| Variance | 28324.42566 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 3199 | 6366 | 6.4% |
| 3198 | 6126 | 6.2% |
| 2857 | 5984 | 6.0% |
| 2856 | 5945 | 6.0% |
| 3197 | 5686 | 5.7% |
| 3196 | 5577 | 5.6% |
| 3200 | 5496 | 5.6% |
| 3201 | 5487 | 5.5% |
| 3205 | 5310 | 5.4% |
| 2855 | 5267 | 5.3% |
| Other values (9) | 41669 |
| Value | Count | Frequency (%) |
| 2852 | 2506 | |
| 2853 | 4824 | |
| 2854 | 5192 | |
| 2855 | 5267 | |
| 2856 | 5945 |
| Value | Count | Frequency (%) |
| 3205 | 5310 | |
| 3204 | 5070 | |
| 3203 | 4921 | |
| 3202 | 4622 | |
| 3201 | 5487 |
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 8.510424413 |
|---|---|
| Minimum | 7.92 |
| Maximum | 8.9 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 772.9 KiB |
Quantile statistics
| Minimum | 7.92 |
|---|---|
| 5-th percentile | 7.92 |
| Q1 | 7.94 |
| median | 8.88 |
| Q3 | 8.89 |
| 95-th percentile | 8.9 |
| Maximum | 8.9 |
| Range | 0.98 |
| Interquartile range (IQR) | 0.95 |
Descriptive statistics
| Standard deviation | 0.4678629516 |
|---|---|
| Coefficient of variation (CV) | 0.05497527842 |
| Kurtosis | -1.816316678 |
| Mean | 8.510424413 |
| Median Absolute Deviation (MAD) | 0.02 |
| Skewness | -0.4273113469 |
| Sum | 841791.61 |
| Variance | 0.2188957415 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 8.88 | 22523 | |
| 8.89 | 21971 | |
| 7.93 | 16404 | |
| 7.94 | 15384 | |
| 8.9 | 15301 | |
| 7.92 | 7330 | 7.4% |
| Value | Count | Frequency (%) |
| 7.92 | 7330 | 7.4% |
| 7.93 | 16404 | |
| 7.94 | 15384 | |
| 8.88 | 22523 | |
| 8.89 | 21971 |
| Value | Count | Frequency (%) |
| 8.9 | 15301 | |
| 8.89 | 21971 | |
| 8.88 | 22523 | |
| 7.94 | 15384 | |
| 7.93 | 16404 |
| Distinct | 199 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 772.9 KiB |
| fr | |
|---|---|
| us | |
| gb | |
| it | |
| de | |
| Other values (194) |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Characters and Unicode
| Total characters | 197826 |
|---|---|
| Distinct characters | 26 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 31 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | gb |
|---|---|
| 2nd row | mc |
| 3rd row | fr |
| 4th row | us |
| 5th row | us |
| Value | Count | Frequency (%) |
| fr | 25135 | |
| us | 20602 | |
| gb | 11310 | |
| it | 8015 | 8.1% |
| de | 6567 | 6.6% |
| es | 5706 | 5.8% |
| au | 2719 | 2.7% |
| dk | 1892 | 1.9% |
| se | 1826 | 1.8% |
| be | 1666 | 1.7% |
| Other values (189) | 13475 |
| Value | Count | Frequency (%) |
| fr | 25135 | |
| us | 20602 | |
| gb | 11310 | |
| it | 8015 | 8.1% |
| de | 6567 | 6.6% |
| es | 5706 | 5.8% |
| au | 2719 | 2.7% |
| dk | 1892 | 1.9% |
| se | 1826 | 1.8% |
| be | 1666 | 1.7% |
| Other values (189) | 13475 |
Most occurring characters
| Value | Count | Frequency (%) |
| s | 28735 | |
| r | 26825 | |
| f | 25826 | |
| u | 24113 | |
| e | 16593 | |
| b | 13298 | |
| g | 12103 | |
| i | 9586 | 4.8% |
| t | 9352 | 4.7% |
| d | 8670 | 4.4% |
| Other values (16) | 22725 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 197826 |
Most frequent character per category
| Value | Count | Frequency (%) |
| s | 28735 | |
| r | 26825 | |
| f | 25826 | |
| u | 24113 | |
| e | 16593 | |
| b | 13298 | |
| g | 12103 | |
| i | 9586 | 4.8% |
| t | 9352 | 4.7% |
| d | 8670 | 4.4% |
| Other values (16) | 22725 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 197826 |
Most frequent character per script
| Value | Count | Frequency (%) |
| s | 28735 | |
| r | 26825 | |
| f | 25826 | |
| u | 24113 | |
| e | 16593 | |
| b | 13298 | |
| g | 12103 | |
| i | 9586 | 4.8% |
| t | 9352 | 4.7% |
| d | 8670 | 4.4% |
| Other values (16) | 22725 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 197826 |
Most frequent character per block
| Value | Count | Frequency (%) |
| s | 28735 | |
| r | 26825 | |
| f | 25826 | |
| u | 24113 | |
| e | 16593 | |
| b | 13298 | |
| g | 12103 | |
| i | 9586 | 4.8% |
| t | 9352 | 4.7% |
| d | 8670 | 4.4% |
| Other values (16) | 22725 |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.First rows
| identifierHash | language | socialNbFollowers | socialNbFollows | socialProductsLiked | productsListed | productsSold | productsPassRate | productsWished | productsBought | gender | civilityGenderId | civilityTitle | hasAnyApp | hasAndroidApp | hasIosApp | hasProfilePicture | daysSinceLastLogin | seniority | seniorityAsYears | countryCode | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | -1097895247965112460 | en | 147 | 10 | 77 | 26 | 174 | 74.0 | 104 | 1 | M | 1 | mr | True | False | True | True | 11 | 3196 | 8.88 | gb |
| 1 | 2347567364561867620 | en | 167 | 8 | 2 | 19 | 170 | 99.0 | 0 | 0 | F | 2 | mrs | True | False | True | True | 12 | 3204 | 8.90 | mc |
| 2 | 6870940546848049750 | fr | 137 | 13 | 60 | 33 | 163 | 94.0 | 10 | 3 | F | 2 | mrs | True | False | True | False | 11 | 3203 | 8.90 | fr |
| 3 | -4640272621319568052 | en | 131 | 10 | 14 | 122 | 152 | 92.0 | 7 | 0 | F | 2 | mrs | True | False | True | False | 12 | 3198 | 8.88 | us |
| 4 | -5175830994878542658 | en | 167 | 8 | 0 | 25 | 125 | 100.0 | 0 | 0 | F | 2 | mrs | False | False | False | True | 22 | 2854 | 7.93 | us |
| 5 | 7631788075812383072 | de | 130 | 12 | 1 | 47 | 123 | 91.0 | 0 | 0 | F | 2 | mrs | True | False | True | False | 11 | 3196 | 8.88 | de |
| 6 | 674361423306028463 | en | 121 | 0 | 1140 | 31 | 108 | 94.0 | 531 | 105 | F | 3 | miss | True | True | False | False | 11 | 3198 | 8.88 | se |
| 7 | 2550976450216757005 | fr | 53 | 9 | 3 | 5 | 106 | 98.0 | 0 | 0 | F | 2 | mrs | True | False | True | True | 11 | 2857 | 7.94 | fr |
| 8 | 3718185418791028367 | it | 744 | 13764 | 51671 | 0 | 104 | 85.0 | 1842 | 0 | F | 2 | mrs | True | False | True | False | 14 | 3195 | 8.88 | it |
| 9 | 3908244093584862523 | en | 57 | 8 | 45 | 123 | 92 | 74.0 | 6 | 2 | F | 3 | miss | True | False | True | True | 11 | 2856 | 7.93 | gb |
Last rows
| identifierHash | language | socialNbFollowers | socialNbFollows | socialProductsLiked | productsListed | productsSold | productsPassRate | productsWished | productsBought | gender | civilityGenderId | civilityTitle | hasAnyApp | hasAndroidApp | hasIosApp | hasProfilePicture | daysSinceLastLogin | seniority | seniorityAsYears | countryCode | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 98903 | -2219367748414812248 | es | 3 | 8 | 0 | 0 | 0 | 0.0 | 0 | 0 | F | 2 | mrs | True | False | True | True | 112 | 3204 | 8.9 | es |
| 98904 | 2896867688384676348 | en | 3 | 8 | 0 | 0 | 0 | 0.0 | 0 | 0 | F | 2 | mrs | False | False | False | True | 708 | 3204 | 8.9 | gb |
| 98905 | 3164321379397826945 | en | 3 | 8 | 6 | 0 | 0 | 0.0 | 0 | 0 | F | 2 | mrs | False | False | False | True | 655 | 3204 | 8.9 | us |
| 98906 | -3379431417039360607 | en | 3 | 8 | 0 | 0 | 0 | 0.0 | 0 | 0 | F | 2 | mrs | False | False | False | True | 708 | 3204 | 8.9 | ie |
| 98907 | -5212100190867739388 | en | 3 | 8 | 0 | 0 | 0 | 0.0 | 0 | 0 | F | 2 | mrs | False | False | False | True | 708 | 3204 | 8.9 | us |
| 98908 | -5324380437900495747 | fr | 3 | 8 | 0 | 0 | 0 | 0.0 | 0 | 0 | M | 1 | mr | False | False | False | True | 708 | 3204 | 8.9 | us |
| 98909 | -5607668753771114442 | fr | 3 | 8 | 0 | 0 | 0 | 0.0 | 0 | 0 | M | 1 | mr | True | False | True | True | 695 | 3204 | 8.9 | fr |
| 98910 | 350630276238833248 | en | 3 | 8 | 0 | 0 | 0 | 0.0 | 0 | 0 | M | 1 | mr | True | True | False | True | 520 | 3204 | 8.9 | be |
| 98911 | 2006580738726207028 | it | 3 | 8 | 0 | 0 | 0 | 0.0 | 0 | 0 | F | 2 | mrs | False | False | False | True | 267 | 3204 | 8.9 | it |
| 98912 | -7621316584087253691 | fr | 3 | 8 | 0 | 0 | 0 | 0.0 | 0 | 0 | M | 1 | mr | True | False | True | True | 561 | 3204 | 8.9 | gn |